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TITLE OF THE IN VENTTON 
Method and Apparatus for Decoding Compressed Video Signals 

BACKGROUND OF THE INVENTION 
Field of the Invention 
The present invention relates generally to decoding of compressed 
video Signals, and more specifically to a method and apparatus for decoding 
video signals such as MPUG-2 video in which video signals are coded 
according to a compression technique that supports field estimation motion 
compensation. The present invention is particularly suitable for applications 
10 where resolution is reduced for relieving the burden on processing signals. 

Descrip tion of the R elated Art 
Need often exists for displaying MPEG-2 video signals on a device 



Another approach is disclosed in a paper "A Drift Free Scalable 



transform) process is performed on a variable-length decoded, inverse- 
25 quantized signal to "downscale" its resolution. With this process, the lower 
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frequency components of the signal are exclusively used, with a resultant 




2 


decrease in the resolution of the signal. The decoded signal then undergoes a 




3 


motion vector compensation process, which is followed by a process in which 




4 


motion compensation is performed by halving the motion vectors. However, 




5 


since the information contained in one of the fields is lost, it is Impossible to 




6 


meet the requirements specified by the MPEG-2 standard only with the use of 




7 


the field estimation technique. 


o 


8 


In order to solve this problem, Japanese Patent Publication P2000- 


if*** 


9 


59793A discloses a video decoding technique for both progressive and 




1(1 


interlaced video signals. The known technique uses a coding scheme known 




11 


as the frame DCT mode in which the video signal is encoded by treating its 


M 


12 


top and bottom fields as a single unit. During decoding, the compressed 




13 


video is first subjected to an inverse frame-DCT process and then the top and 


w 


14 


bottom fields are separated from each other, The separated fields are 


Q 


15 


individually subjected to an inverse fleld-DCT process in which low 




16 


frequency components are exclusively used to downscale the resolution of 




17 


the signal. Although much of the field information can be retained, the prior 




18 


art requires a substantial amount of computations, which would impose a 




19 


great burden on software-driven systems such as personal computers. 




20 


SUMMARY OF THE INVENTION 




21 


It is therefore an object of the present invention to provide a method 




22 


and apparatus for decoding a compressed video signal by downswing its 




23 


resolution with reduced computations and high-speed capability. 




24 


According to a first aspect of the present invention, there is provided a 




25 


video decoding method for decoding a coded picture by a reference picture, 
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1 wherein the coded picture contains first and second fields and the reference 

2 picture exclusively contains a first field. The method comprises exclusively 

3 decoding the first field of the coded picture whereby thp decoded picture 

4 contains motion vectors and determining whether the first field of the 

5 reference picture or a nonexistent second field of the reference picture is 

6 referenced. If the first field of the reference picture is determined to be 

s s 7 referenced, a motion compensation is performed by using the motion vectors. 

W S If the nonexistent second field of the reference picture is determined to be 

H 9 referenced, the motion vectors are corrected so that thev extend from the first 



gj 10 field of the reference picture to the decoded first field and a motion 

11 compensation is performed by using the corrected motion vectors, 

12 According to a second aspect, the present invention provides a video 

13 decoding method for decoding a coded picture by Ubing a reference picture, 



UJ 14 wherein the coded picture contains first and second fields and the reference 

o 

U. 15 picture exclusively contains a first field, the method comprising exclusively 

1 fi decoding the first field of the coded picture whereby the decoded picture 

17 contains motion vectors and determining whether field estimation or frame 

1 8 estimation ii> to be used for motion compensation. If the field estimation is 
10 determined to be used, the first field of the reference picture or a nonexistent 

20 second field of the reference picture is determined to be referenced, If the 

21 first field of the reference picture ia> determined to be referenced, a motion 

22 compensation is performed by using the motion vectors. If the nonexistent 

23 second field of the reference picture is determined to be referenced, the 

24 motion vectors are corrected so that they extend from the first field of die 

25 reference picture to the decoded first field and a motion compensation is 
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1 performed by using the corrected motiorL vectors. If the frame estimation is 

2 determined to be used, average values of successive lines of the first field of 

3 the reference picture are calculated, and the motion vectors are corrected 

4 using the average values and a motion compensation is performed by using 

5 the calculated motion vectors. 

6 According to a third aspect of the present invention, there is provided 

7 a video decoding method for decoding a coded picture by using a reference 



y 8 picture, wherein the coded picture contains first and second fields and is 

o 



9 structured as field picture or frame picture, and the reference picture 

10 exclusively contains a first field, the method comprising the steps of (a) 

11 exclusively decoding the first field of the coded picture whereby the decoded 

12 picture contains motion vectors, (b) determining whether the decoded first 

f* 13 field is structured as field picture or as frame picture, (c) if the decoded first 

IP* 

W 1 4 field is determined to be structured as field picture, determining whether the 

U 

H 15 first field of the reference picture or a nonexistent second field of the 

16 reference picture is referenced, (d) if the first field of the reference picture is 

17 determined to be referenced, performing a motion compensation by using the 

18 decoded motion vectors, (e) if the nonexistent second field of the reference 

19 picture is determined to be referenced, correcting the decoded motion vectors 

20 so that they extend from the first field of the reference picture to the decoded 

21 first field and performing a motion compensation by using the corrected 

22 motion vectors, (f) if the decoded first field is determined to be structured as 

23 frame picture, determining whether field estimation or frame estimation is to 

24 be used, and (g) if the field estimation is determined to be used, repeating 

25 steps (d) and (e), and if the frame estimation is determined to be used, 
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1 calculating average values of successive lines of the first held of the reference 

2 picture, calculating motion vectors using the average values and performing 

3 a motion compensation by using the calculated motion vectors. 

4 According to a fourth aspect of the present invention, there is 

5 provided an apparatus for decoding a coded picture by using a reference 

6 picture, wherein the coded picture contains first and second fields and the 

7 reference picture exclusively contains a first field. The apparatus comprises 

8 decoding circuitry for exclusively decoding the first field of the coded 

9 picture, motion compensation circuitry, and motion vector correction 

10 circuitry, and control circuitry, The control circuitry provides the functions of 

11 for causing the motion compensation circuitry to perform a motion 

12 compensation by using motion vector* decoded by the decoding circuitry if 

13 the rirst field of the reference picture is referenced. Further, the control 

14 circuitry causes the motion vector correction circuitry lo correct the decoded 

15 motion vectors so that they extend from the first field uf the reference picture 

16 to the decoded first field and causes the motion compensation circuitry fco 

17 perform a motion compensation by using the corrected motion vector* if the 
IS nonexistent second field of the reference picture is referenced. 

19 According to a fifth aspect, the present invention provides an 

20 apparatus for decoding a coded picture by using a reference picture, wherein 

21 the coded picture contains first and second fields, and the reference picture 

22 exclusively contains a first field. The apparatus comprises decoding circuitry 

23 for exclusively decoding the first field of the coded picture, motion 

24 compensation circuitry; motion vector correction circuitry, averaging 

25 circuitry, and control circuitry. The control circuitry performs the functions 

26 of causing the motion compensation circuitry to perform a motion 

27 compensation by us>ing the decoded motion vectors if the first field of the 

28 reference picture is referenced. If the nonexistent second field of the 
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1 reference picture is referenced, the control circuitry causes the motion vector 

2 correction circuitry to correct the decoded motion vectors so that the 

3 corrected vectors extend from the first field of the reference picture to the 

4 decoded first field and causes the motion compensation circuitry to perform a 

5 motion compensation by using the corrected motion vectors. If frame 

6 estimation is used, the control circuitry causes the averaging circuitry to 

7 calculate average values of successive lines of the first field of the reference 
S picture, causes the motion vector correction circuitry to correct the decoded 

9 motion vectors by using the average values and causes the motion 

10 compensation circuitry to perform a motion compensation by using the 
y l i corrected motion vectors. 

12 BRIEF DESCRIPTION OF THE DRAWIGNS 

13 The present invention will be described in detail further with reference 
jU 14 to the following drawings, in which: 

j s j. 15 Fig. 1 is a block diagram of a video decoder according to the present 

^ 16 invention; 

17 Fig. 2 is a flowchart of the operation of die video decoder; 

18 Fig. 3 is an illustration of a macroblock which is structured as field 

19 picture and coded in a field DCT mode for storage and then decoded 

20 according to the present invention; 

21 Fig. 4 is an illustration of a macroblock which is structured as frame 

22 picture and coded in a frame DCT mode for storage and then decoded 

23 according to the present invention; 

24 Fig. 5 is an illustration of video lines for extending a motion vector 

25 when field estimation is used; and 

2ft Fig. 6 is an illustration of video lines for yielding average values of 
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1 top-field lines to substitute for nonexistent bottom field lines when frame 

2 estimation is used. 

* DETAILED DESCRIPTION 

4 In Fig. 1, a video decoder according to the pr&enl invention is 

5 illustrated. A compressed video signal encoded according to the MPEG-2 

6 standard is stored in a memory device 101. MPEG-2 video deals with objects 

7 that are used to structure video information. A picture is one of the objects 

8 defined by the standard. Each picture is divided into a number of blocks of 8 

9 x 8 pixels each, which are grouped into macroblocks. Four blocks of 

10 luminance values plus a number of blocks with chrominance values form a 

11 macroblock. Three picture types are defined in MPEG-2 video: Intra-coded 

12 pictures (T-pictures), predictive coded pictures (P-picturcs) and 

13 bidirectionally predicted pictures (B-pictures). The pictures are either coded 

14 as "field pictures" to handle interlaced video display modes, or "frame 



W 
Q 

H 15 pictures" to handle non-interlaced display modes. Two field pictures always 



16 occur in a pair, one containing the top field, the other containing the bottom 

17 field of the complete frame. Held pictures are used if they contain motion 
1 H and not SO much detail. Frame pictures, on the other hand, are formed by 
19 first combining the top and bottom fields to form a frame, which is then 

2u encoded. Frame pictures are used if there is a lot of detail in the picture and 

21 only limited motion. A video sequence represents a number of such pictures 

22 which occur in a predetermined order. Since the I-picture in a video 

23 sequence is encoded in such a way that it can be decoded without knowing 

24 anything about other pictures in the video sequence, the first picture in that 

25 sequence is an 1-picture and serves as a reference picture. 
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1 In a video sequence, the P-picture is decoded by using information 

2 from the reference picture (I-picture) of the video sequence which was 

3 displayed previously. The P-picture contains information that cannot be 

4 burrowed from the reference picture. Such information is coded in the same 

5 way I-pictures are coded. Therefore, the P-picture contains intra-coded 

6 macroblocks (Tmacroblocks) which can be used as reference information, and 

7 predictive coded macroblocks (P-macroblocks). Two B-pictures occur in each 

8 video sequence, Each B-picture uses information from a previous picture and 

9 a future picture of the video sequence. As in P-pirtures, picture information 

10 that cannot be found in previous or future pictures is intra-coded as I- 

11 macroblocks. 

12 The signal stored in the input buffer 102 is read on a per-picture basis, 

13 for example, into an input buffer 1 02, where it is fed to a decoder 103 and a 

14 controller 104 on a per-macroblock basis. Controller 104 is a microprocessor - 

15 based controller which reads programmed instruction data from a s>turage 

16 medium 150 such as a removable disk. Decoder 103 is set in one of two 

17 decoding modes (frame OCT mode or field DCT mode) according to a 

18 command signal from the controller 104 and performs decoding exclusively 

19 on top-field lines of the signal when enabled by the controller. The decoding 

20 process includes variable length decoding, inverse quantization and inverse 8 

21 v8 DCT (discrete cosine transform) decoding, Thus, the controller 104 needs 

22 to identify the macroblock whether it is structured as frame picture or as field 

23 picture. If the macroblock is identified a* a frame picture the decoder 103 is 

24 set in the frame DCT mode. Otherwise, the decoder is set m the field DCT 

25 mode, Additionally, the decoder 103 supplies decoded motion vectors to the 
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1 controller 104, 

2 A previous reference buffer 105 and a future reference buffer 106 are 

3 connected to the output of the decoder 103 for storing decoded macroblocks 

4 to form previous and future reference pictures, respectively, in response to a 

5 write command from the controller 104. The output of decoder 1U3 is further 

6 coupled to an output buffer 112 for copying data that can be directly 

7 displayed such as I-pictures and I-macroblocks contained in either B-picrures 
O 8 and P-pictures. 

0 The data stored in the reference buffers 1 0S and 1 06 are read by the 

10 controller 104 when the current macroblock uses information from previous 

11 or future picture. As will be described in detail, when the current macroblock 



s 

m 



* 12 is of a frame picture type, the controller 104 requests a motion vector 



H 13 corrector 107 to correct decoded motion vectors by using the outputs of 

y 14 reference buffers 105, 106. During a field estimation mode, the controller 104 

»T 15 commands the motion vector corrector 107 to correct the decoded motion 

16 vectors by extending their starting points to the first field for coupling to 

17 motion compensation circuitry 1 11, to which the outputs of decoder 103 and 
15 reference buffers 105, 106 are also applied through lines 109 and 110. An 

19 averaging circuit 1 08 is provided to respond to a command signal from the 

20 controller 104 for producing an average value of successive lines of the top 

21 field of a reference picture to estimate its nonexistent (virtual) bottom field 

22 during a frame estimation mode. Further, the averaging circuit 1 08 produces 

23 an average value of successive lines of the top field of the current picture to 

24 estimate its nonexistent bottom field during the frame estimation mode. The 

25 output of the average circuit 108 is cuupled lo the motion vector corrector 107 
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1 tu recalculate molion vectors. 

2 According to a command signal from the controller J 04, the motion 

3 compensation circuitry 111 selects its input signals on which to perform 

4 motion compensation and delivers a restored signal to the output buffer 112. 

5 The data stored in the output buffer 112 will be read into a display 

6 device 1 1 3 when a complete picture is restored and displayed. 

7 The operation of the controller 104 along with its associated elements 
S will be understood with the following description of a routine shown in the 

N 9 flowchart of Fig. 2. 

10 The routine begins with step 201 where the controller commands the 

11 input buffer 102 to read a picture from the memory device 101, At step 202, a 
* ; . 12 macroblock is read out of the buffer 102 intu the decoder 103 and the 

13 controller 104. Controller 103 examines the structure of the macroblock and 

W 14 identifies whether it is structured as frame picture or a field picture and sets 

U 15 the decoder 103 in one of the two decoding modes according to the identified 

16 structure (step 203) and enables the decoder to proceed decoding on the top- 

17 field lines of the macroblock (step 2U4). 

18 A macroblock (16 x 16 pixels) is formed in field picture as shown in 

19 Fig. 3, in which odd-numbered lines are separated from even-numbered lines 

20 lo form two 8x8 top-field blocks and two 8x8 bottom-field blocks, which 

21 are coded prior to storage in the memory device 101. Decoder 103 provides 

22 field decoding exclusively on the top-field blocks, leaving the bottom -field 

23 blocks entirely non-decoded. 

24 In a similar way, a macroblock is structured in frame picture as shown 

25 in Fig. 4 and coded prior to storage In the memory device 101, and then read 
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1 from ihe device 101 and proceed by the decoder 103. Odd numbered lines 

7 (top-field lines) and even -numbered lines (bottom-field lines) occur at regular 

3 intervals and divided into four blocks of 8 x 8 pixels each. Decoder 103 

4 performs the frame decoding only on the top-field lines, leaving the bottom - 

5 field lines non-decoded. 

6 Tn the coding process, a search made through previous and future 

7 pictures for a macroblock that matches or closely matches the current 
» macroblock, If such a macroblock is found, the difference between this 
9 macroblock and the current macroblock is determined- The resulting 

10 difference is the motion vector, which is first DCT coded and then variable 

Ct 11 length coded together with the motion vector of the macroblock. 

12 The single-field decoding lowers the resolution of the picture but 

13 significantly relieves the burden of computations. 

14 At decision step 205, the controller 103 examines the decoded 

15 macroblock to identify it as 1-picture, R-picturp or P-picture. If the 

16 macroblock is identified as I-picture, flow proceeds from step 205 to step 208 

17 to copy the macroblock to the previous reference buffer 105, and thence to 

18 step 217 to store the copy to the output buffer 112. If the macroblock is 

19 identified as B-picture, flow proceeds to step 209 to check to see if it is an I- 

20 macroblock of B-picture. If this is the case, the controller branches out to step 

21 217 for storing the copy of the macroblock In the output buffer. If decision 

22 step 205 identifies the macroblock as P-picture, flow exits to step 206 to 

23 determine if it is an I-macroblock of P-picture. If so, the controller proceeds 

24 to step 207 to copy the macroblock to the future reference buffer 106 and 

25 further moves to step 217 to store the copy in the output buffer 112. 
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i 


If the decision is negative at either step 209 or 206, the controller 




2 


recognizee that it needs some information from reference buffers and then 




3 


determineb which picture or pictures are required based on the transmission 




4 


order of pictures as specified by the MPEG-2 standard. As a result, the 




5 


controller reads, at step 210, field data of one or more stored pictures from the 




6 


buffers 105 and 106 into the motion compensation circuitry 111, depending 




7 


on whether the current macroblock is B-picturc or P-picture. 


M 

? s 


8 


At step 21 1, the controller re-examines the structure of the macroblock 


9 


as it did at step 203. If the current macroblock was identified as Held picture, 


5 

19 


10 


the controller proceeds to decision step 212 to determine which field of the 


fc. : 


11 


reference picture to be referred to for motion compensation of the current 


Sa4 


12 


macroblock. 




13 


Before proceeding with the description of motion compensation, it is 




14 


appropriate to discuss the ''field estimation" and "frame estimation" 




15 


currently used in the state-of-the art. In the field estimation mode, field 
✓ 




16 


information is used as a basic unit for individually estimating the motion of a 




17 


macroblock by referring to either of the top and bottom fields of a reference 




18 


picture. Therefore, a bottom field of one or more reference pictures may be 




19 


referred to when a top field is estimated and a top field of one or more 






rpfpYPTirp rrirtnrps niav bp referred to when a bottom field estimated. In the 




21 


frame estimation method, a top field is estimated by reference to a top field of 




22 


one or more reference frames and a bottom field is estimated by reference to a 




23 


bottom field of one or more reference frames. 




24 


If the controller determines, at step 212, that the top-field is to be 




25 


referenced, flow advances to step 214 to command the motion compensation 
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1 circuitry 111 to perform a motion compensation using the motion vectors 

2 decoded at step 203. More specifically, the motion compensation circuitry 

3 u^es the decoded motion vectors to identify a macroblock in the retrieved 

4 reference field, and combines the identified reference macroblock with the 

5 current macroblock data (i.e., the decoded difference) to restore the original 

6 macroblock (at step 215). 

7 If the controller determines that the bottom- field lines of the reference 

hit 

O 8 picture are to be referenced, flow proceeds from step 212 to step 213. Since 

H 9 the bottom-field data are not decoded in the present invention, the data to be 

m 10 referenced is nonexistent in both of the reference b uff ers 105 and 106 when 

11 the controller proceeds from step 212 to step 213, In this case, the controller 

12 directs the motion vector corrector 107 to correct the decoded motion vectors 

13 "y" that extend from the virtual bottom field of the reference picture to the 

14 current picture by extending their starling points to the top field of the 

15 reference picture and calculating the extended vectors. 

16 Fig. 5 explains details of the calculation of the extended motion vector 

17 performed by step 213. If the time interval between the top and bottom fields 
1$ of each picture is "t", the time interval between the bottom field of the 

19 reference picture and the top field of the next picture is also "t". The 

20 reference and current pictures are thus separated from each other by an 

21 interval n x t, where n is an integer. In Fig. 5, the decoded motion vector "y" 

22 represents the motion of a pixel PI on the top field of the current picture in 

23 the direction of vertical scan (line-by-line) from a virtual (nonexistent) pixel 

24 P2 on the bottom field of the reference picture, a vector "Y" represents the 

25 motion of the pixel PI from an extended point of the vector "/" lhal intersects 
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1 the top field of Ihe reference picture. The extended motion vector "Y" is 

2 calculated as follows: 

3 Y-(n + l)(y/n) (1) 

4 if Y is not: an integer, an internally divided value of the distance between 

5 successive lines is used instead of the non-integral number. Thus, when step 

6 212 determines that the bottom field is to be used as a reference, the controller 

7 commands the motion vector corrector 107, at step 213, to calculate Equation 

8 (1) to obtain extended motion vectors. The extended motion vectors are used 

9 at step 214 to identify a reference macroblock in the reference field and the 

10 current macroblock is combined with the reference macroblock at step 215. 

11 In addition, if the current macroblock is used as a reference picture, the 

12 values of successive lines of the top field of the current macroblock are 

13 averaged to construct motion vectors for the virtual bottom field of the 

14 current macroblock, as indicated by dotted lines in Fig, 5. In this case, such 

15 motion vectors are used by the motion compensation steps 214, 215 to 

16 reconstruct the bottom field and stored in one of the reference buffers. 

17 If, at step 211, the current macroblock is identified as frame picture, 

18 fluw proceeds to step 219 where the controller 104 determines which of two 

19 estimation methods (field estimation and frame estimation) to he used for 

20 motion compensation, depending on the motion vectors decoded and 

21 supplied from the decoder 1 03 at step 203. 

22 If step 219 determines that the field estimation method be used, the 

23 controller proceeds to step 212 to perform the same procedures as it did on 

24 the macroblock of field picture. If the frame estimation is to be used for 
2=> motion compensation, the controller proceeds to step 220 to instruct the 
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1 average circuit 108 to calculate average values of successive lines of the top 

2 field of the reference picture as shown in Fig. 6 and uses the average values as 

3 substitutes fur the virtual bottom field of the reference picture. Flow 

4 proceeds from step 220 to step 221 to calculate motion vectors that extend 

5 from the estimated bottom-field pixels of the reference picture to the top field 

6 of the current picture. In addition, if the current macroblock is used as a 

7 reference picture, the values of successive lines of the top field of the current 

8 macroblock are averaged to construct motion vectors for the virtual bottom 

9 field of the current macroblock, as indicated by dotted lines in Fig. 6. In this 

10 case, such motion vectors are used by the motion compensation steps 214, 215 

11 lo reconstruct the bottom field and stored in one of the reference buffers. 

12 Step 221 is followed by step 214, where the calculated motion vectors 

13 of step 220 arc used to identify a reference macroblock in the reference field 

14 to be combined with the current macroblock at step 215. 

15 If the restored macroblock obtained at step 215 is to be used as a 

16 reference picture (step 216), a copy of the restored macroblock is supplied 

17 from the motion compensation circuitry 111 through line 113 and stored in 

1 R one of the reference buffers depending on the position of the current picture 

19 in the video sequence (step 222) and further stored in the output buffer for 

2U display (step 217). If the decision is negative at step 216, the combined data is 

21 copied to the output buffer (step 217) and not copied to the reference buffers, 

22 Step 217 is followed by decision step 218, where the current 

23 macroblock is checked to see if the end of picture is reached. If not, the 

24 controller returns to step 202 to read the next macroblock from the input 

25 buffer. If the end of picture is reached, the controller returns to the starting 
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1 point of the routine to read the next picture from the memory device 1 01 to 

2 the input buffer 102. 
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